Exploring the use of Machine Translation resources for English-Japanese Cross-Language Information Retrieval

نویسندگان

  • Gareth Jones
  • Tetsuya Sakai
  • Nigel Collier
  • Akira Kumano
  • Kazuo Sumita
چکیده

This paper describes a detailed investigation into the use of Machine Translation techniques for query translation in English-Japanese Cross-Language Information Retrieval (CLIR). Experimental results are reported for the standard BMIR-J2 Japanese text retrieval collection extended to a CLIR task. Results indicate that retrieval performance increases as greater linguistic processing is used in the translation process, and that local feedback via term reweighting and query expansion is e ective for this task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Indonesian-Japanese CLIR Using Only Limited Resource

Our research aim here is to build a CLIR system that works for a language pair with poor resources where the source language (e.g. Indonesian) has limited language resources. Our IndonesianJapanese CLIR system employs the existing Japanese IR system, and we focus our research on the IndonesianJapanese query translation. There are two problems in our limited resource query translation: the OOV p...

متن کامل

Coling • Acl 2006

Our research aim here is to build a CLIR system that works for a language pair with poor resources where the source language (e.g. Indonesian) has limited language resources. Our IndonesianJapanese CLIR system employs the existing Japanese IR system, and we focus our research on the IndonesianJapanese query translation. There are two problems in our limited resource query translation: the OOV p...

متن کامل

Comparing Multiple Methods for Japanese and Japanese-English Text Retrieval

The NACSIS collection of Japanese scienti c documents (with English titles) provides a solid foundation for information retrieval research into 1) segmentation methods for Japanese text, 2) e ective methods for monolingual Japanese retrieval, and 3) JapaneseEnglish cross-language retrieval. This paper compares multiple methods for Japanese and Japanese-English text retrieval. Our focus is on ac...

متن کامل

CMU in Cross-Language Information Retrieval at NTCIR-3

We participated in the Cross-Language Information Retrieval evaluation at NTCIR-3 for the EnglishChinese and English-Japanese tasks. We examined several approaches to query translation, including the use of a commercial machine translation system, a thesaurus that is automatically extracted from a parallel corpus, and a general-purpose online dictionary. The MT-based approach was most effective...

متن کامل

How Similar are Chinese and Japanese for Cross-Language Information Retrieval?

For NTCIR Workshop 5 UC Berkeley participated in the bilingual task of the CLIR track. Our focus was on Chinese topic searches against the Japanese News document collection, and on Japanese topic search against the Chinese News Document Collection. Extending our work of NTCIR 4 workshop, we performed search experiments to segment and use Chinese search topics directly as if they were Japanese t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999